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Abstract 



We show that the rank of a depth-3 circuit (over any field) that is simple, min- 
imal and zero is at most (9(fc 3 logd). The previous best rank bound known was 
2°( fe2 )(logd)' £ - 2 by Dvir and Shpilka (STOC 2005). This almost resolves the rank 
question first posed by Dvir and Shpilka (as we also provide a simple and minimal 
identity of rank fl(klogd)). 

Our rank bound significantly improves (dependence on k exponentially reduced) 
the best known deterministic black-box identity tests for depth-3 circuits by Karnin 
and Shpilka (CCC 2008). Our techniques also shed light on the factorization pattern 
of nonzero depth-3 circuits, most strikingly: the rank of linear factors of a simple, 
minimal and nonzero depth-3 circuit (over any field) is at most 0(k 3 log d). 

The novel feature of this work is a new notion of maps between sets of linear forms, 
called ideal matchings, used to study depth-3 circuits. We prove interesting structural 
results about depth-3 identities using these techniques. We believe that these can lead 
to the goal of a deterministic polynomial time identity test for these circuits. 

1 Introduction 

Polynomial identity testing (PIT) ranks as one of the most important open problems in the 
intersection of algebra and computer science. We are provided an arithmetic circuit that 
computes a polynomial p{x\ , X2, • • • , x n ) over a field F, and we wish to test if p is identically 
zero. In the black-box setting, the circuit is provided as a black-box and we are only 
allowed to evaluate the polynomial p at various domain points. The main goal is to devise 
a deterministic polynomial time algorithm for PIT. Kabanets and Impagliazzo [KI04] and 
Agrawal |Agr05] have shown connections between deterministic algorithms for identity 
testing and circuit lower bounds, emphasizing the importance of this problem. 

The first randomized polynomial time PIT algorithm, which was a black-box algorithm, 
was given (independently) by Schwartz [Sch80j and Zippel |Zip79]. Randomized algorithms 
that use less randomness were given by Chen & Kao [CKOOj . Lewin & Vadhan [LV98j. 
and Agrawal & Biswas [AB03J. Klivans and Spielman [KS01] observed that even for 
depth-3 circuits for bounded top fanin, deterministic identity testing was open. Progress 
towards this was first made by Dvir and Shpilka [DS06J, who gave a quasi-polynomial 
time algorithm, although with a doubly-exponential dependence on the top fanin. The 
problem was resolved by a polynomial time algorithm given by Kayal and Saxena [KS07J, 
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with a running time exponential in the top fanin. For a special case of depth-4 circuits, 
Saxena [Sax08j has designed a deterministic polynomial time algorithm for PIT. Why is 
progress restricted to small depth circuits? Agrawal and Vinay |AV08j recently showed 
that an efficient black-box identity test for depth-4 circuits will actually give a quasi- 
polynomial black-box test for circuits of all depths. 

For deterministic black-box testing, the first results were given by Karnin and Sh- 
pilka [KS08J. Based on results in [DS06j , they gave an algorithm for depth-3 circuits 
having a quasi-polynomial running time (with a doubly-exponential dependence on the 
top faninj^j. One of the consequences of our result will be a significant improvement in 
the running time of their deterministic black-box tester. 

This work focuses on depth-3 circuits. A structural study of depth-3 identities was 
initiated in l)S()(ij by defining a notion of rank of simple and minimal identities. A depth-3 
circuit C over a field F is: 

k 

C(x\, . . . , x n ) = ^ Ti 

i=l 

where, Ti (a multiplication term) is a product of d{ linear functions lij over F. Note 
that for the purposes of studying identities we can assume wlog (by homogenization) that 
iij's are linear forms (i.e. linear polynomials with a zero constant coefficient) and that 
d\ = • • • = dk ='■ d. Such a circuit is referred to as a SIIS(fc, d) circuit, where k is the top 
fanin of C and d is the degree of C. We give a few definitions from [DS06J. 

Definition 1. [Simple Circuits] C is a simple circuit if there is no nonzero linear form 
dividing all the Ti 's. 

[Minimal Circuits] C is a minimal circuit if for every proper subset S C [k], Y^ieS 
is nonzero. 

[Rank of a circuit] The rank of the circuit, rank(C), is defined as the rank of the 
linear forms lij 's viewed as n-dimensional vectors over F. 

Can all the forms li be independent, or must there be relations between them? The 
rank can be interpreted as the minimum number of variables that are required to express 
C. There exists a linear transformation converting the n variables of the circuit into 
rank{C) independent variables. A trivial bound on the rank (for any SllS-circuit) is kd, 
since that is the total number of linear forms involved in C. The rank is a fundamental 
property of a T,UT,(k,d) circuit and it is crucial to understand how large this can be for 
identities. A substantially smaller rank bound than kd shows that identities do not have 
as many "degrees of freedom" as general circuits, and lead to deterministic identity tests^l 
Furthermore, the techniques used to prove rank bounds show us structural properties of 
identities that may suggest directions to resolve PIT for SIIS(A;, d) circuits. 

Dvir and Shplika [DS06J proved that the rank is bounded by 2°( k ' (log d) k ~ 2 , and this 
bound is translated to a pory(n)exp(2 ( fc2 )(logd) fc ~ 1 ) time black-box identity tester by 
Karnin and Shpilka [KS08J. Note that when k is larger than y/log d, these bounds are 
trivial. 

Our present understanding of Y,UY,(k,d) identities is very poor when k is larger than 
a constant. We present the first result in this direction. 

1 |KS08| had a better running time for read-fc depth-3 circuits, where each variable appears at most k 
times. But even there the dependence on k is doubly-exponential. 
2 We usually do not get a polynomial time algorithm. 
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Theorem 2 (Main Theorem). The rank of a simple and minimal £LT£(A;,d) identity is 
0(k 3 logd). 

This gives an exponential improvement on the previously known dependence on k, and 
is strictly better than the previous rank bound for every k > 3. We also give a simple 
construction of identities with rank Q(klogd) in Section [21 showing that the above theo- 
rem is almost optimal. As mentioned above, we can interpret this bound as saying that 
any simple and minimal EIIE(fc, d) identity can be expressed using 0(fc 3 log<i) indepen- 
dent variables. One of the most interesting features of this result is a novel technique 
developed to study depth-3 circuits. We introduce the concepts of ideal matchings and 
ordered matchings, that allow us to analyze the structure of depth-3 identities. These 
matchings are studied in detail to get the rank bound. Along the way we initiate a theory 
of matchings, viewing a matching as a fundamental map between sets of linear forms. 

Why are the simplicity and minimality restrictions required? Take the non-simple 
SIIS(2, d) identity (x\X2 • • • xj) — {x\X2 ■ ■ ■ Xd). This has rank d. Similarly, we can take 
the non-minimal SIIS(4, d + 1) identity (2/12/2 ■ ■ ■ yd){ x i ~ x i) + ' • • z d)( x 2 — x 2) that 
has rank (2d+2). In some sense, these restrictions only ignore identities that are composed 
of smaller identities. 

1.1 Consequences 

Apart from being an interesting structural result about SIIS identities, we can use the 
rank bound to get nice algorithmic results. Our rank bound immediately gives faster 
deterministic black-box identity testers for SIIS(fc, d) circuits. A direct application of 
Lemma 4.10 in [KS08J to our rank bound gives an exponential improvement in the 
dependence of k compared to previous black-box testers (that had a running time of 
poly(n)exp(2°( A;2 )(log d)* -1 )). 

Theorem 3. There is a deterministic black-box identity tester for SIIS(fc, d) circuits that 
runs in poly(n,d k lo s d ) time. 

The above black-box tester is now much closer in complexity to the best non black-box 
tester known (poly(n,d k ) time by [KS07] ). 

Our result also applies to black-box identity testing of read-k SIIS(fe, d) circuits, where 
each variable occurs at most k times. We get a similar immediate improvement in the 

dependence of k (the previous running time was n 2 ° (fc .) 

Theorem 4. There is a deterministic black-box identity tester for read-k EIIE(fc, d) cir- 
cuits that runs in 0(n k lo s fc ) time. 

Although it is not immediate from Theorem [21 our technique also provides an in- 
teresting algebraic result about polynomials computed by simple, minimal, and nonzero 
EUE(k,d) circuital. Consider such a circuit C that computes a polynomial p{x\, ■ ■ ■ ,x n ). 
Let us factorize p into n^i' where each qi is a nonconstant and irreducible polynomial. 
We denote by L{p) the set of linear factors of p (that is, qi G L(p) iff qi\p is linear). 

Theorem 5. If p is computed by a simple, minimal, nonzero EIIE(A;,d) circuit then the 
rank of L(p) is at most A; 3 logd. 

3 Here we can also consider circuits where the different terms in C have different degrees. The parameter 
d is then an upper bound on the degree of C. 
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1.2 Organization 



We first give a simple construction of identities with rank Q(k log d) in Section [2j Section [3] 
contains the proof of our main theorem. We give some preliminary notation in Section r3.ll 
before explaining an intuitive picture of our ideas (Section 13. 2h . We then explain our main 
tool of ideal matchings (Section 13. 3D and prove some useful lemmas about them. We move 
to Section 13.41 where the concepts of ordered matchings and simple parts of circuits are 
introduced. We motivate these definitions and then prove some easy facts about them. 
We are now ready to tackle the problem of bounding the rank. We describe our proof in 
terms of an iterative procedure in Section T3.5I Everything is put together in Section T3.6I to 
bound the rank. Finally (it should hopefully be obvious by then), we show how to apply 
our techniques to prove Theorem El 



2 High Rank Identities 

The following identity was constructed in [KS07] : over F2 (with r ^ 2), 

C(xi, . . . ,x r ) := Yl (hxi-\ h fcr-iav-i) 

6i,...,6 r -ieF 2 
6i+-+6 r _i=l 

+ Yl ( Xr + 

b±xi + • • • + b r ^\x r ^i) 

&i,...,fe r _ieF 2 
6i+-+6 r -isO 

+ Yl {x r + bixi~\ h b r -ix r -i) 

6i,...,fe r _ieF 2 

bi+-+b r -l=l 

It was shown that, over F2, C is a simple and minimal SITE zero circuit of degree d = 2 r ~ 2 
with k = 3 multiplication terms and rank(C) = r = log 2 d + 2. For this section let Si(x), 
S2(x), Ss(x) denote the three multiplication terms of C. We now build a high rank identity 
based on Si, S2, S3 . Our basic step is given by the following lemma that was used in [DS06J 
to construct identities of rank (3k — 2). 

Lemma 6. WS06I Let D^y^i, . . . , y^rj := Y2j=i Tj be a simple, minimal and zero SITS 
circuit, over ¥2, with degree di, fanin k{ and rank r{. Define a new circuit over ¥2 using 
Di and C: 

/ki-l \ 

A+l(j/i,l, • • • ,yi,n+r) '■= j T i I ' ^{Vi,n+li ■ ■ ■ >2/i,n+r) — • 5*2 (2/2,^+1 , • • • ,yi,n+r) 

\j=l J 

— ■ Sz{yi,n+u ■ ■ ■ > Vi,n+r) 

Then Di+i is a simple, minimal and zero SIIS circuit with degree di+i = (di + d), fanin 
k i+ i = (hi + 1) and rank r i+ i = (rj + r). 

Proof. Since C is an identity, we get that S 2 (yi, n +i, • • • , yi, n +r) + S 3 (y itn +i, ■■■ , yi, n +r) = 
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Si(yi, n +i, ■ ■ .,yi,n+r)- Therefore, 

A+l(2/i,l, • • • ,Vi,ri+r) 
ki-1 

= ( ^ Tj)Si(yi >n +i, . . . ,yi,r t +r) - T k . (S 2 (yi, n +i, ■ ■ ■ ,yi,r t +r) + £3(2/1,^+1, • • • ,yi,n+r)) 
i=i 
fci-i 

= ( ^ ' • • • >2/i,fi+r) + TfejS'i^j^+i, . . . ,yi, n +r) 

3=1 

K 

= (^2 T j) ' 5 1 (f i,ri+l . ■ ■ ■ . I/i,ri+r ) = 

The terms Tj do not share any variables with Sg (£ £ {1,2,3}). Since Di and C 
are simple, A+i is also simple. Suppose A+i is not minimal. We have some subset 
P C [1, such that C" := {^2j e pTj)Si — a 2 T ki S2 — a^T ki S^ = 0, where a 2 ,a 3 G {0, 1}. 

If both a 2 and 03 are 1, then we get (%2jeP Tj)Si + T k .S\ = 0, now P must be the whole 
set [l,ki — 1], because Di is minimal. On the other hand, if both 02,03 are 0, then 
(J2j£pTj)S\ = which is impossible as Di is minimal. The only remaining possibility 
is (wlog) {YljepTj)Si — T ki S 2 = 0. As S\ is coprime to 5 2 and T k ., this is impossible. 
Therefore, A+i is minimal. 

It is easy to see the parameters of A+i : = (&i + 1) an d dj+i = (di + 1). Because 
the Tj's do not share any variables with the rank rj+i = (r» + r). □ 

Family of High Rank Identities: Now we will start with Dq := C(yo,i, • • • ,2/o,r) 
and apply the above lemma iteratively. The i-th circuit we get is Di with degree di = 
(i + l)d, fanin fej = i + 3 and rank = (i + l)r = (i + l)(log 2 d + 2). So r« relates to fej, di 
as: 

ri = - 2) (log 2 + 2^ . 

Also it can be seen that if d > i then > \fd~i- Thus after simplification, we have for 
any 3 < i < d, ri > ^ • log 2 di- This gives us an infinite family of SnS(A;, d) identities 
over F 2 with rank f2(A;logd). A similar family can be obtained over F3 as well. 

3 Rank Bound 

Our technique to bound the rank of SIIE identities relies mainly on two notions - form- 
ideals and matchings by them - that occur naturally in studying a SITS circuit C. Using 
these tools we can do a surgery on the circuit C and extract out smaller circuits and 
smaller identities. Before explaining our basic idea we need to develop a small theory of 
matchings and define gcd and simple parts of a subcircuit in that framework. 

We set down some preliminary definitions before giving an imprecise, yet intuitive 
explanation of our idea and an overall picture of how we bound the rank. 

3.1 Preliminaries 

We will denote the set { 1 , . . . , n} by [n\. 
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In this paper we will study identities over a field F. So the circuits compute multivariate 
polynomials in the polynomial ring R := ¥[xi, . . . ,x n ]. We will be studying T l UT,(k,d) 
circuits : such a circuit C is an expression in R given by a depth-3 circuit, with the top gate 
being an addition gate, the second level having multiplication gates, the last level having 
addition gates, and the leaves being variables. The edges of the circuit have elements of F 
(constants) associated with them (signifying multiplication by a constant). The top fanin 
is k and d is the degree of the polynomial computed by C. We will call C a TITE-identity, 
if C is an identically zero SllS-circuit. 

A linear form is a linear polynomial in R. We will denote the set of all linear forms 



Much of what we do shall deal with sets of linear forms, and various maps between them. 
A list L of linear forms is a multi-set of forms with an arbitrary order associated with 
them. The actual ordering is unimportant : we merely have it to distinguish between 
repeated forms in the list. One of the fundamental constructs we use are maps between 
lists, which could have many copies of the same form. The ordering allows us to define 
these maps unambiguously. All lists we consider will be finite. 

Definition 7. [Multiplication term] A multiplication term / is an expression in R 
given as (the product may have repeated £'s): 



The list of linear forms in /, L{f), is just the list S of forms occurring in the product 
above. #L(f) is naturally called the degree of the multiplication term. For a list S of 
linear forms we define the multiplication term of S, M(S), as Yli € g£ or 1 if S = <fi. 

Definition 8. [Forms in a Circuit] We will represent a T,UT,(k,d) circuit C as a sum 



rank(C), is just the number of linearly independent linear forms in L(C). 
3.2 Intuition 

We set the scene, for proving the rank bound of a SIIS(A;, d) identity, by giving a combi- 
natorial/graphical picture to keep in mind. Our circuits consist of k multiplication terms, 
and each term is a product of d linear forms. Think of there being A; groups of d nodes, so 
each node corresponds to a form and each group represents a terrrQ. We will incrementally 
construct a small basis for all these forms. This process will be described as some kind of 
a coloring procedure. 

At any intermediate stage, we have a partial basis of forms. These are all linearly 
independent, and the corresponding nodes (we will use node and form interchangeably) 
are colored red. Forms not in the basis that are linear combinations of the basis forms 
(and are therefore in the span of the basis) are colored green. Once all the forms are 

4 A form that appears many times corresponds to that many nodes. 



by L{R) : 




f := c ' 11 ^' where c G F* and S is a list of linear fc 



orms. 
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colored, either green or red, all the red forms form a basis of all forms. The number of 
red forms is the rank of the circuit. When we have a partial basis, we carefully choose 
some uncolored forms and color them red (add them to the basis). As a result, some other 
forms get "automatically" colored green (they get added to the span). We "pay" only for 
the red forms, and we would like to get many green forms for "free". Note that we are 
trying to prove that the rank is k°^ log d, when the total number of forms is kd. Roughly 
speaking, for every forms we color red, we need to show that the number of green 

forms will double. 

So far nothing ingenious has been done. Nonetheless, this image of coloring forms is 
very useful to get an intuitive and clear idea of how the proof works. The main challenge 
comes in choosing the right forms to color red. Once that is done, how do we keep 
an accurate count on the forms that get colored green? One of the main conceptual 
contributions of this work is the idea of matchings, which aid us in these tasks. Let us 
start from a trivial example. Suppose we have two terms that sum to zero, i.e. T\ +T2 = 0. 
By unique factorization of polynomials, for every form i £ T\, there is a unique form 
m 6 T2 such that £ = cm, where c G F* (we will denote this by £ ~ to). By associating the 
forms in T\ to those in T2, we create a matching between the forms in these two groups 
(or terms). This rather simple observation is the starting point for the construction of 
matchings. 

Let us now move to k = 3, so we have a simple circuit C = T\ + T2 + T3 = 0. Therefore, 
there are no common factors in the terms. To get matchings, we will look at C modulo 
some forms in T3. By looking at C modulo various forms in T3, we reduce the fanin of C 
and get many matchings. Then we can deduce structural results about C. Similar ideas 
were used by Dvir and Shpilka [DS06] for their rank bound. Taking a form g G T3, we 
look at C(mod q) which gives T\ +T2 = 0(mod q). By unique factorization of polynomials 
modulo q, we get a q-matching. Suppose (£, to) is an edge in this matching. In terms of 
the coloring procedure, this means that if q is colored and £ gets colored, then m must also 
be colored. At some intermediate stage of the coloring, let us choose an uncolored form 
q £ T3. A key structural lemma that we will prove is that in the q-matching (between 
Xi and T2) any neighbor of a colored form must be uncolored. This crucially requires the 
simplicity of C. We will color q red, and thus all neighbors of the colored forms in T\ U T2 
will be colored green. By coloring q red, we can double the number of colored forms. It 
is the various matchings (combined with the above property) that allow us to show an 
exponential growth in the colored forms as forms in T3 are colored red. By continuing this 
process, we can color all forms by coloring at most O(logcf) forms. Quite surprisingly, the 
above verbal argument can be formalized easily to prove that rank of a minimal, simple 
circuit with top fanin 3 is at most (log 2 d + 2). For this case of k = 3, the logarithmic rank 
bound was there in a lemma of Dvir and Shpilka [DS06J, though they did not present the 
proof idea in this form, in particular, their rank bound grew to (logc?) 2 for k = 4. 

The major difficulty arises when we try to push these arguments for higher values of 
k. In essence, the ideas are the same, but there are many technical and conceptual issues 
that arise. Let us go to k = 4. The first attempt is to take a form q £ T4 and look at 
C(mod q) as a fanin 3 circuit. Can we now simply apply the above argument recursively, 
and cover all the forms in T\ U T2 U T3? No, the possible lack of simplicity in C(mod q) 
blocks this simple idea. It may be the case that T\,T<i and T3 have no common factors, 
but once we go modulo q, there could be many common factors! (For example, let q = x\. 



7 



Modulo q, the forms x\ + X2 and X2 would be common factors.) 

Instead of doing things recursively (both |DS06] and [KS07| used recursive arguments) , 
we look at generating matchings iteratively. By performing a careful iterative analysis that 
keeps track of many relations between the linear forms we achieve a stronger bound for 
k > 3. We start with a form £\ G T\, and look at C(mod £\). From C(mod £\), we remove 
all common factors. This common factor part we shall refer to as the gcd of C(mod £±), 
the removal of which leaves the simple part of C(mod £\). Now, we choose an appropriate 
form £2 from the simple part, and look at C(mod ^1,^2)- We now choose an £3 and so on 
and so forth. For each £ that we choose, we decrease the top fanin by at least 1, so we will 
end up with a matching modulo the ideal (£±,£2, • ■•,£r), where r < (k — 2). We call these 
special ideals form ideals (as they are generated by forms), and the main structures that 
we find are matchings modulo form ideals. The coloring procedure will color the forms 
in the form ideal red. Of course, it's not as simple as the case of k = 3, since, for one 
thing, we have to deal with the simple and gcd parts. Many other problems arise, but we 
will explain them as and when we see them. For now, it suffices to understand the overall 
picture and the concept of matchings among the linear forms in C. 

We now start by setting some notation and giving some key definitions. 

3.3 Ideal Matchings 

We will use the concept of ideal matchings to develop tools to prove Theorem El In this 
subsection, we provide the necessary definitions and prove some basic facts about these 
matchings. 

First, we discuss similarity between forms and form ideals. 
Definition 9. We give several definitions : 

• [Similar forms] For any two polynomials f,g£Rwe call f similar to g if there is 
a c G F* such that f = eg. We say f is similar to g mod /, for some ideal I of R, if 
there is a c G F* such that f = eg (mod I). We also denote this by f ~ g (mod I) 
or f is I-similar to g. 

• [Similar lists] Let Si = (a±, . . . , ad) and S2 = (61, ... , bd) be two lists of linear 
forms with a bisection tt between them. Si and S2 are called similar under tt if for 
all i G [d], ai is similar to n(ai). Any two lists of linear forms are called similar if 
there exists such a tt. Empty lists of linear forms are similar vacuously. For any 
£ G L(R) we define the list of forms in Si similar to £ as the following list (unique 
upto ordering): 

simi(£, Si) := (a G S\ \ a is similar to £) 
We call S±, S2 coprime lists if \/£ G Si, #simi(£, S2) = 0. 

• [Form-ideal] A form-ideal / is the ideal (I) of R generated by some nonempty 
I C L(R). Note that if I = {0} then a = b(mod I) simply means that a = b 
absolutely. 

• [Span sp(S)] For any S C L(R) we let sp(S) C L(R) be the linear span of the linear 
forms in S over the field F. 
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• [Orthogonal sets of forms] Let Si,..., S m be sets of linear forms for m>2. We 
call Si, ... , S m orthogonal if for all m' G [m — 1]: 

sp{ (J 5j) n sp(S m , +1 ) = {0} 

j£[m'] 

Similarly, we can define orthogonality of form-ideals I\,...,I m . 

We give a few simple facts based on these definitions. It will be helpful to have these 
explicitly stated. 

Fact 10. Let U, V be lists of linear forms and I be a form-ideal. If U, V are similar then 
their sublists U' := (£ G U \ I G sp(I)) and V' := {£ G V \ £ G sp(I)) are also similar. 

Proof. If £/, V are similar then for some c G F*, M(F) = cM(U). This implies: 

M(V') • M(V \ V") = cM(C/') • M(f7 \ 17') 

Since elements of U\U' are not in sp(I), for any f £ F', f does not divide M({7 \ f/'). In 
other words M(V') divides M(U'), and vice versa. Thus, M(U'), M(V') are similar and 
hence by unique factorization in R, lists U' , V are similar. □ 

Fact 11. Let li,li be two orthogonal form-ideals of R and let D be a T,UT,(k,d) circuit 
such that L(D) has all its linear forms in sp(Ii). If D = (mod I2) then D = 0. 

Proof. As Ii , I2 are orthogonal we can assume Ii to be {£1 , ■ ■ ■ , £ m } and I2 to be {£[ , ■ ■ ■ , £' m , } 
where the ordered set V := {£1, . . . ,£ m , £\, . . . ,£' m /} has (m + ml) linearly independent 
linear forms. Clearly, there exists an invertible linear transformation r on sp({xi, . . . , x n }) 
that maps the elements of V bijectively, in that order, to xi, . . . ,x m+m i. On applying r 
to the equation D = (mod I2) we get: 

t(D) = (mod x m +i, . . .,x m+m >), where t(D) € ¥[x 1 , . . .,x m \. 

Obviously, this means that t(D) = which by the invertibility of r implies D = 0. □ 

We now come to the most important definition of this section. We motivated the 
notion of ideal matchings in the intuition section. Thinking of two lists of linear forms as 
two sets of vertices, a matching between them signifies some linear relationship between 
the forms modulo a form-ideal. 

Definition 12. [Ideal matchings] Let U, V be lists of linear forms and I be a form-ideal. 
An ideal matching n between U, V by I is a bijection it between lists U, V such that: for 
all £ £ U, tt(£) = c£ + v for some c £ F* and v G sp(I). The matching tt is called trivial 
if U, V are similar. 

Note that tt being a bijection and c being nonzero together imply that 7r _1 can also 
be viewed as a matching between V, U by I. We will also use the terminology I -matching 
between U and V for the above. Similarly, an I -matching tt between multiplication terms 
f,g is the one that matches L(f),L(g). (For convenience, we will just say "matching" 
instead of "ideal matching".) 

The following is an easy fact about matchings. 
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Fact 13. Let tt be a matching between lists of linear forms U,V by I and let U' C U , 
V C V be similar sublists. Then there exists a matching tt' between U,V by I such that: 
U' , V are similar under tt' . 

Proof. Let £' G U' be such that %(£') = d'l' + v' (for some d' G F* and v' G sp(I)) is not 
in V or is not similar to £'. As V is similar to V there exists a form equal to a£' in V', 
for some a G F*, and tt being a matching must be mapping some £ G U to a^" in V. Also 
from the matching condition there must be some d G F* and v G sp(7) such that tt(£) = 
d£ + v = a£'. 

Now we define a new matching tt by flipping the images of £ and £' under tt, i.e., define 

7? to be the same as tt on U \ {£,£'} and: tt(£) : = vr(£') and vr(^') : = tt{£). Note that 7r 
inherits the bijection property from tt and it is an /-matching because: tt{£') = a£' for 
a G F* and more importantly, 

n(£) = tt{£>) = d'£' + v' = d> (*±^) W = W + i/) 

The form (^- + z/) is clearly in sp(I). Thus, we have obtained now a matching tt 
between U, V by I such that the £' G L7 7 is similar to tt(£') G V. 

Note that we increased the number of forms in U' that are matched to similar forms 
in V' . If we find another form in U' that is not matched to a similar form in V , we can 
just repeat the above process. We will end up with the desired matching tt' in at most 
jfJJ 1 many iterations. □ 

We are ready to present the most important lemma of this section. The following 
lemma shows that there cannot be too many matchings between two given nonsimilar 
lists of linear forms. It is at the heart of our rank bound proof and the reason for the 
logarithmic dependence of the rank on the degree. It can be considered as an algebraic 
generalization of the combinatorial result used by Dvir & Shpilka (Corollary 2.9 of [DS06J). 

Lemma 14. Let U, V be lists of linear forms each of size d > and I±, . . . , I r be orthogonal 
form-ideals such that for all i G [r], there is a matching tti between U,V by Jj. If r > 
(log 2 d + 2) then U, V are similar lists. 

Before giving the proof, let us first put it in the context of our overall approach. In 
the sketch that we gave for k = 3, at each step, we were generating orthogonal matchings 
between two terms. For each orthogonal matchings we got, we colored one linear form 
red (added one form to our basis) and doubled the number of green forms (doubled the 
number of forms in the circuit that are in the span of the basis) . This showed that there 
is a logarithmic-sized basis for all L{C). If we take the contrapositive of this, we get that 
there cannot be too many orthogonal matchings between two (nonsimilar) lists of forms. 
For dealing with larger k, it will be convenient to state things in this way. 

Proof. Let U\ C U be a sublist such that: there exists a sublist V\ C V similar to U\ 
for which U' := U \ U\ and V' := V \V\ are coprime lists. Let U', V' be of size d' . If 
d' = then U, V are indeed similar and we are done already. So assume that d' > 0. By 
the hypothesis and Fact [TBI f° r all i £ [ r ]> there exists a matching tt[ between U, V by ij 
such that: U±, V\ are similar under tt^ and tt[ is a matching between U' , V by Jj. Our 
subsequent argument will only consider the latter property of tt[ for all i G [r]. 
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Intuitively, it is best to think of the various ir^s as bipartite matchings. The graph 
G = (U',V',E) has vertices labelled with the respective form. For each ir^ and each 
I G U', we add an (undirected) edge tagged with Ii between I and There may be 

many tagged edges between a pair of vertices^]. We call vr^(^) the Ij-neighbor of I (and 
vice versa, since the edges are undirected). Abusing notation, we use vertex to refer to a 
form in U' L)V'. We will denote Uj<t h by </;■ 

We will now show that there cannot be more than (log 2 d + 2) such perfect matchings 
in G. The proof is done by following an iterative process that has r phases, one for each 
Jj. This is essentially the coloring process that we described earlier. We maintain a partial 
basis for the forms in U' U V' which will be updated iteratively. This basis is kept in the 
set B. Note that although we only want to span U' U V' , we will use forms in the various 
Ij's for spanning. 

We start with empty B and initialize by adding some I G U' to B. In the ith round, 
we will add all forms in Ii to B. All forms of U' U V in sp({£} U Jj) are now spanned. We 
then proceed to the next round. To introduce some colorful terminology, a green vertex is 
one that is in the set sp(B) (a form in (U' U V) n sp(B)). Here is a nice fact : at the end 
of a round, the number of green vertices in U' and V' are the same. Why? All forms of I\ 
are in B, at the end of any round. Let vertex v be green, so v G sp(B). The Ji-neighbor 
of v is a linear combination of v and I\. Therefore, the neighbor is in sp(B) and is colored 
green. This shows that the number of green vertices in U is equal to the number of those 
in V. 

Let io G [r] be the least index such that {£} , I\ , . . . , Jj are not orthogonal, if it does 
not exist then set iq := r + 1. Now we have the following easy claim. 

Claim 15. The sets {£}, I±, . . . , Ii —i are orthogonal and the sets: 

{^} U Ji , -Zj -)_i, . . . , I r 

are orthogonal. 

Proof of Claim [731 The ideals {£}, Ji, . . . , ij -i are orthogonal by the minimality of iq. 

As I\, . . . , Ji,, are orthogonal but {£}, Ji,. . . , Jj are not orthogonal we deduce that 
{£} G sp(Ji ). Thus, {£} U sp(Ji Q ) = sp(Ji ) which is orthogonal to the sets Ii +i, . . . ,I r 
by the orthogonality of I\ , . . . , I r . □ 

We now show that the green vertices double in at least (r — 2) many rounds. 

Claim 16. For i {l,io}, the number of green vertices doubles in the ith round. 

Proof of Claim [Jh\ Let £' be a green vertex, say in U', at the end of the (i — l)th round 
(B = {£} U Ji-i). Consider the Ij-neighbor of £'. This is in V' and is equal to (c£' + v) 
where c G F* and v is a nonzero element in sp(Ii) (this is because U', V' are coprime). If 
this neighbor is green, then v would be a linear combination of two green forms, implying 
v G sp(B). But by Claim [To] Ii is orthogonal to B, implying v G sp(B) D sp(Ij) = {0} 
which is a contradiction. Therefore, the /j-neighbor of any green vertex is not green. On 
adding Ii to B, all these neighbors will become green. This completes the proof. □ 

5 It can be shown, using the orthogonality of the Ws, that an edge can have at most two distinct tags. 
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We started off with one green vertex £, and U', V each of size d! . This doubling can 
happen at most log 2 d! times, implying that (r — 2) < log 2 d! . 

□ 

Remark 17. The bound of r = log 2 d + 2 is achievable by lists of linear forms inspired by 
Section® Fix an odd s and define: 

U := {(61X1 H h fe s _ix s _i + x s ) I bx, ... , G {0, 1} s.t. b\ -\ h is even} 

V := {{bixi H h 6 s _ix s _i + x s ) | 61, ... , o s _i G {0, 1} s.t. b\ -\ h o s _i zs odd} 

It is easy to see that over rationals, #U = j^V = 2 S ~ 2 and for all i G [s — 1], there is a 
matching between U, V by (xi), furthermore, there is a matching by (xi + - ■ ■ +x s -i + 2x s ). 
Thus there are {log2\U\ + 2) many orthogonal matchings between these nonsimilar U,V; 
showing that our Lemma is tight. 

3.4 Ordered Matchings and Simple Parts of Circuits 

Before we delve into the definitions and proofs, let us motivate them by an intuitive 
explanation. 

3.4.1 Intuition 

Our main goal is to deal with the case k > 3. The overall picture is still the same. We 
keep updating a partial basis S for L(C). This process goes through various rounds, each 
round consisting of iterations. At the end of each round, we obtain a form-ideal / that is 
orthogonal to S. In the first iteration of a round, we start by choosing a form i\ in L(T\) 
that is not in sp(S), and adding it to /. We look at C(mod £\) in the next iteration, 
which is obviously an identity, and try to repeat this step. The top fan-in has gone down 
by at least one, or in other words, some multiplication terms have become identically 
zero (mod £\). We will say that the other terms have survived. The major obstacle to 
proceeding is that our circuit is not simple any more, because there can be common factors 
among multiplication terms modulo £\. Note how this seems to be a difficulty, since it 
appears that our matchings will not give us a proper handle on these common factors. 
Suppose that form v is now a common factor. That means, in every surviving term, there 
is a form that is v modulo £\. So these forms can be £i-matched to each other! We 
have converted the obstacle into some kind of a partial matching, which we can hopefully 
exploit. 

Let us go back to C(mod £\). Let us remove all common factors from this circuit. This 
stripped down identity circuit is the simple part, denoted by sim(Cmod £\). The removed 
portion, called the gcd part, is referred to as gcd(Cmod £±). By the above observation, 
the gcd part has ^i-matchings. A key observation is that all the forms in the gcd part are 
not similar to £\. This is because we were only looking at nonzero terms in C(mod ^1). 
Having (somewhat) dealt with gcd(Cmod £\) by finding /-matchings, let us focus on the 
smaller circuit sim(Cmod £{) 

We try to find an £2 G L(sim(Cmod £±)) that is not in sp(S U {^i}). Suppose we 
can find such an £2. Then, we add £2 to / and proceed to the next iteration. In a 
given iteration, we start with a form-ideal /, and a circuit sim(Cmod I). We find a form 
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£ G L(sim(Cmod I))\sp(S U /). We add £ to I (for convenience, let us set I' = I U {£}) 
and look at the C(mod /'). We now have new terms in the gcd part, which we can match 
through /'-matchings. As we observed earlier, all the terms that have forms in I' are 
removed, so the terms we match here are all nonzero modulo V . We remove the gcd part 
to get sim(Cmod I'), and go to the next iteration with /' as the new /. When does 
this stop? If there is no £ in L(sim(Cmod I))\sp(S U I), then this means that all of 
L(sim(Cmod I)) is in our current span. So we happily stop here with all the matchings 
obtained from the gcd parts. Also, if the fan-in reaches 2, then we can imagine that the 
whole circuit is itself in the gcd portion. At each iteration, the fan-in goes down by at 
least one, so we can have at most (k — 2) iterations in a round, hence the / in any round 
is generated by at most (k — 2) forms. When we finish a round obtaining an ideal I, there 
are some multiplication terms in C that are nonzero modulo / after the gcd parts in the 
various iterations are removed from these terms. These we shall refer to as constituting 
the blocking subset of [k], for that round. 

The way we prove rank bounds is by invoking Lemma [T4"l Each round constructs a 
new orthogonal form ideal. At the end of a round, we have a set S, which is a partial 
basis. If S does not cover all of L(C), then we use the above process (of iterations) to 
generate a form-ideal I orthogonal to S. Consider two terms T a and T& that survive this 
process (mod I). At each stage, when we add a form to /, we remove forms from T a and 
T;,, /-matching them. When we stop with our form-ideal /, we can think of T a and 7& as 
split into two parts : one having forms from sp(SL)I), and the other which is /-matched. 
For each orthogonal form-ideal we generate, we match subsets of terms. We use Lemma HH 
to tell us that we cannot have too many such form-ideals, which leads to the rank bound. 

3.4.2 Definitions 

We start with looking at the particular kind of matchings that we get. Take two terms T a 
and Tj, that survive a round, where we find the form-ideal / generated by {£±,£2, • • • ,£ r }- 
At the end of the first iteration, we add £\ to /. No form in L(T a )L)L(Tb) can be 0(mod £\). 
We match some forms in T a to Tf, via ^i-matchings. They are removed, and then we proceed 
to the next iteration. We now match some forms via sp{{£\,£2}) matchings and none of 
these forms are in this span. So in each iteration, the forms that are matched (and then 
removed) are non-zero mod the partial / obtained by that iteration. We formalize this as 
an ordered matching. 

Definition 18. [Ordered matching] Let U, V be lists of linear forms and an ordered 
set I = {vi, . . . ,Vi} be a form-ideal having i > 1 linearly independent linear forms. A 
matching it between U,V by I is called an ordered /-matching if : 

Let vo be zero. For all £ £ U, ir(£) = (c£ + w) where c £ F*, and w G sp(vo, . . . , Vj) for 
some j satisfying I ^ sp(vo, . . . , Vj). 

We add the zero element Vq, just to deal with similar forms in U and V. Note that the 
inverse bijection 7r _1 is also an ordered matching between V, U by /. It is also easy to see 
that if 7Ti and TT2 are ordered matchings between lists U\,V\ and lists U2, V2 respectively 
by the same ordered form-ideal / then their disjoint union, tq U-7T2, is an ordered matching 
between lists U x U U 2 , Vi U V 2 by /. 

We will stick to the notation in Definition [T51 For convenience, let spj := sp(vo, • • • , Vj). 
Let ir(£) = d£ + w, where w G spj but £ g" spj then the constant d is unique. If there were 
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two such different constants, say d and d', then both (tt(£) — d£) and (n(£) — d'£) would be 
in spj implying that (d — d')£ £ spj. That contradicts £ spj. Thus for a fixed £ and an 
ordered matching n, d is uniquely determined. Keeping the notation above, we can well 
define : 

Definition 19. [Scaling factor] The scaling factor of an ordered matching tt between 
U and V is denoted by sc(tt). For each £ G U, let de be the unique constant such that 
ir(£) = di£ + w, where w 6 spj but £ spj. Then sc(ir) := n^gc/ F° r empty U, sc(n) 
is set to be 1. 

Definition 20. [Subcircuits and regular circuits] For non-empty Q C [k], the sub- 
circuit Cq of a Y,HY,(k,d) circuit C is the sum ^2j G gTj. For a form-ideal I we call Cq 
regular mod I if Vg 6 Q, T q ^ [mod I). We will denote the constant factor in the 
multiplication term T q by a q £ ¥* , thus T q = a q M(L(T q )). 

We are now ready to define the gcd and sim parts of a subcircuit. Although the ideas 
are quite simple and intuitive, we have to be careful in dealing with constant factors. 
Much of this notation has been introduced for rigorous definitions. Take a subcircuit Cq 
that is regular mod / as well as an identity mod /. A maximal list of forms, say U, that 
divides T q , for all q 6 Q, is called the gcd of Cg(mod /). In every T q , there is a list U q of 
forms that are /-similar to U. Therefore, we have 7-matchings between U and U q . This 
is the gcd data of Cq modulo I, and represents that various matchings that we will later 
exploit. If we remove U q from each T q , then (by accounting for constants carefully) we get 
a simple (mod I) identity, the sim part of Cg(mod /). We formalize this below. 

Let Cq be regular modulo /. Fix a q\ in Q. Let U be a maximal sublist of L(T qi ) such 
that M{U) divides T q modulo I for all q € Q. Since R/I is isomorphic to a polynomial 
ring, the nonconstant polynomials in R/I satisfy unique factorization property, i.e. any 
polynomial in R that is nonconstant modulo / uniquely factors modulo the ideal (J) into 
polynomials irreducible modulo /. Since Cq is regular modulo / and U C L(T qi ) is a 
maximal list such that \/q G Q, M{U) \ T^mod I): 

• M(U) is a gcd of the polynomials {T q \ q 6 Q} modulo the ideal (I). 

• For all q 6 Q, there exists a sublist U q C L(T q ) and a c q € F* such that M(U q ) = 
c q ■ M(U) (mod I). By unique factorization in R/I and regularity of Cq mod I this 
gives an ordered matching -K q between U, U q by /. Also, by the definition of scaling 
factor of a matching, ir q satisfies: \/q G Q, M(U q ) = sc{-K q ) ■ M{U) (mod I). 

Note that given Cq and I there are many possibilities to choose the lists U and {U q \ q £ Q} 
but they are all uniquely determined upto similarity modulo the ideal (I) and that will be 
good enough for our purposes. So we choose them in some way, say the lexicographically 
smallest one unless specified otherwise, and define the gcd data. Using the gcd data of 
Cq mod / we can extract out a smaller circuit from Cq which we call the simple part. 

Definition 21. [gcd and sim parts] The gcd data of Cq modulo I is the following set 
°f #Q matchings: 

^c~d(C Q modI):={(TT q ,U,U q )\q£Q} (1) 
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The gcd of C<5(mod I) is just gcd{Cqmod I) := M(U). The simple part of Cq mod / is 
the circuit: 

sim{C Q mod I) := ^ sc{it q )a q ■ M(L(T q ) \ U q ) 
<?e<2 

Before a round, we have a partial basis S. At the end of a round, we produce a 
form-ideal / that is orthogonal to S. We call this a useful ideal. Let Q C [A;] be such 
that all T q , q G Q survive (mod I). This is called the blocking subset. For each such q, 
there are a list of forms V q C L(T q ) that are mutually matched via ordered /-matchings 
(these are really a collection of gcd datas). This is called the matching data. Even after 
we remove V q from each term T q (carefully accounting for constants, as explained above), 
we still have an identity mod I. All forms of this identity are in sp(S U I) \ sp(I), since we 
assume that the round has ended. Furthermore by rearranging linear forms, all V^'s can 
be made disjoint to sp(S 'U I)\sp(I) . Therefore this round partitions the L(T q ) into V q and 
L(T q ) n (sp(S U I)\sp(I)) (for all q G Q). These end-of-a-round properties are formalized 
by the following definition. 

Definition 22. [Useful ideals, blocking subsets, and matching data] Let C = 

J2j<k^j' Tj = ctj M(L(Tj)). The set S C L{R) and I is an ordered form-ideal orthogonal 
to S. We call I useful in C wrt S if3Q C [A;], 1 < #Q < k with the following properties : 
For all qeQ, let V q be L{T q )\(sp(S U I) \ sp(I)). (Therefore, L{T q ) \V q C sp(S U J) \ 
sp(I).) 

• There exists a list of linear forms V such that for all q G Q, there is an ordered 
I -matching r q between V,V q . 

• The circuit X^eQ sc ( T q) a q " M(L(T q ) \ V q ) is a regular identity modulo I. 

Such a Q we call a blocking subset ofC, S, I. By matching data ofC, S, I, Q we will mean 
the set: 

mdata(C, S, I, Q) := {(r„ V, V q ) \ q G Q} 
We will call mdata(C, S, I, Q) trivial if the lists V q , q G Q, are all mutually similar. 

From the matching data, we will exploit the fact that for each pair q\,q2 G Q, there 
is an ordered /-matching between V qi and V q2 . Nonetheless, we will represent these #Q 
matchings via V because it will be more convenient to deal with the intermediate gcd 
parts while we are building /. 

3.4.3 Basic facts 

In this subsection, we prove some basic facts about ordered matchings, scaling factors and 
gcd and sim parts of a circuit. These facts are not difficult to prove, but it will be helpful 
later to have them. 

The following two properties are immediate from the definition of scaling factor. 

Fact 23. Let m and iT2 be ordered I-matchings between lists U\,Vi and lists U2,V2 re- 
spectively. Then sc(7r{" 1 ) = sc^i) -1 and sc(tt\ U 7^) = sc(iri) ■ sc(iT2). 

Thus, ordered matchings have inverses, have a union and the following fact shows that 
they can also be composed. 
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Fact 24. Let tt\ and TT2 be ordered matchings between U±,V and V,U% respectively by the 
same ordered form-ideal I = {v±, . . . ,Vi}. Then the naturally defined composite matching 
7T2vri is also an ordered matching between U\,U2 by I. Furthermore, sc(tt2TT\) = sc(tt\) ■ 

SC(7T 2 ). 

Proof. Consider a linear form I G U\. There exists c\ G F* and a\ G spj Y ,£ ^ spj 1 such 
that iti(£) = c\£ + a\. Also, there exists C2 G F* and cti G spj 2 , iri(£) ^ spj 2 such that 
7T2{tti(£)) = C2{cii+a\) + a2- Let j = max{ji, j'2}. Obviously, (c2«i+a2) G spj. If £ G spj 
then as £ G" sp^ we deduce that j = ji > ji, thus £ G spj 2 , implying iri(£) = c\£+ai G spj 2 , 
which is a contradiction. Therefore, £ ^ sp^. This proves that the composite bijection 
7T27T1 is an ordered matching. 

The contribution from the image of £ G U\ to sc{ti2'^i) is C1C2 while the corresponding 
contributions of £ G U\ to sc(7Ti) is c\ and of 7Ti(^) G F to sc{tt2) was C2. Thus, sc(tt2TT\) = 
sc[tt\) ■ sc(tt2). □ 

The scaling factor nicely characterizes the ratio of M(U) and M(V) when U, V are 
similar. 

Fact 25. Let ir be an ordered matching between lists U, V of linear forms, by an ordered 
form-ideal L = {v\, . . . ,v{\. If it is trivial then M(V) = sc(ir) ■ M \U) . Thus all the ordered 
matchings, between a given pair of similar lists, have the same scaling factor. 

Proof. The proof idea is identical to the one seen in Fact [I3j 

Let £ G U be such that ir(£) = d£ + v is not similar to £, where d G F*, v G spj and 
£ $l spj. Since V is similar to U there exists a form equal to c£ in V, for some c G F*. As 
it is an ordered matching, it must be mapping some £' G U to c£ in V, satisfying: tt(£') = 
d'£' + v 1 = c£, where d' £¥*, v' £ spj>, and £' spj>. 

Now we define a new matching n by flipping the images of £ and £' under it, i.e., define 

7r to be the same as n on U\{£,£'} and: tt(£) := tt(£') and tt(£') := vr(£). The matching n is 
an ordered matching because: tt(£) = c£ for c G F* and more importantly tt(£') = d£ + v = 
d (ll±vL) +v= (<^)f + + „). Let j* := max{j,/}. Obviously, + v) G sp r . If 
j* = j', we are done, because we already know that £' spf. If j* = j and £' G spj, then 
c£ = d'£' + i/ is in spj (contradiction) . 

We have obtained now an ordered matching tt between U, V by / where the number of 
forms mapped to a similar form has strictly increased. Observe that sc(n) had a unique 
contribution of d, d' from the images of £, £' respectively while sc(tt) has a corresponding 
contribution of c, (— ). On all the other elements of U, tt is the same as tt. Thus, we have 
that sc(tt) = sc(ir). 

The above process will yield an ordered matching tt' in at most jflJ many iterations, 
such that U, V are similar under tt' and sc(tt') = sc(tt). But this means that, for all £ G U, 
tt'{£) = X£, for some A G F*. By definition the contribution by £ to sc(tt') would be then 
A. This clearly implies that M(V) = sc(vr') • M(U) and finally M(V) = sc(vr) • M(U). □ 

We move on to facts about the gcd and sim parts of a circuit. 

Fact 26. // Cq is a regular mod I subcircuit of C then: 

Cq = gcd{CQmod I) ■ sim(CQmod I) (mod I) 
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Additionally, if Cq is an identity modulo I then sim(CQmod I) is a simple identity modulo 
I. 

Proof. Recall that Cq = Ylq^Q Tq and the gcd data gcd(CQmod I) is {(n q , U, U q ) \ q G Q}. 
Now T q = a q M{U q ) ■ M(L(T q ) \ U q ) and M(U q ) = sc(^ q ) ■ M(U) (mod /), where M(U) is 
gcd{CQm.od I). Thus, 

Cq = Yl a q sc(TT q )M(U) ■ M(L(T q ) \ U q ) (mod I) 
q£Q 

= gcd(CQmod I) ■ sim{CQiaod I) (mod /) 

This proves the first part. Assume now that Cq = 0(mod /) which means sim(CQmod I) = 
0(mod /). If it is not a simple identity mod /, then there is an £' £ L(sim(CQmod I)) 
such that, Mq £ Q, £' \ M(L(T q ) \ U q ) mod /. Then, M{U) cannot be the gcd of the 
polynomials {T q \ q £ Q} modulo the ideal (I) (contradiction). □ 

When / = {0} we write gcd(CQ), gcd(CQ) and sim(CQ) instead of gcd(CQmod I), 
gcd(CQmod I) and sim(CQmod I) respectively. We collect here some properties of sim(Cg) 
that would be directly useful in our rank bound proof. 

Fact 27. Let £ G L(R)* and Cq be a subcircuit of C. Then #simi(£, L(sim(Cg))) > 
iff3qi,q2 £ Q such that #simi(£, L(T qi )) ^ #simi(£, L(T q2 )) . 

Proof. Note that #simi(£, L(T q )) is the highest power of £ that divides T q . Thus, if 
#simi(£, L(T q )) is the same, say r, for all q £ Q then the highest power of £ dividing 
gcd{Co) is also r implying that for all q £ Q, the polynomial gc< i(c Q ) ls coprime to £. By 
definition of the simple part of Cq this means that #simi(£, L(sim{CQ))) = 0. 

Conversely, if for an £ £ L(R)*, 3q±,q2 £ Q such that #simi(£, L(T qi )) > #simi(£, 
L(T q2 )) then it is easy to see that gc d{CQ) cann °t be coprime to £. This implies that 
#simi(£,L(sim(C Q ))) > 0. ' □ 

Fact 28. Let S C L(R) and Q2 Q Qi C [kj. If L{sim{CQ 1 )) has all its linear forms in 
sp(S), then all the linear forms in L{sim(CQ 2 )) are also in sp{S). 

Proof. For an arbitrary £ £ L(sim(CQ 2 )), by Fact [271 there are qi,q2 £ Q2 such that 
#simi(£, L(T qi )) ^ #simi(£, L(T q2 )). As (71,(72 G Qi, we can again apply Fact l27l to 
deduce that #simi(£, L(sim(CQ 1 ))) > 0. Therefore £ £ sp(S). □ 

Fact 29. Let S C L(R) and Qi,Q 2 Q [k] such that Qi n Q 2 ^ (f>. If L(sim(C Ql )) and 
L{sim{CQ 2 )) have all their linear forms in sp{S) then all the linear forms in L(sim(CQ 1 uQ 2 )) 
are also in sp(S). 

Proof. Take go £ Qi n Q2 and an arbitrary £ £ L(sim(CQ 1 \ J Q 2 )) . By Fact ETJ there are 
Qi,Q2 G Qi U Q2 such that #simi(£, L(T qi )) ^ #simi(£, L{T q2 )) . 

If (71,(72 are in the same set (wlog, in Qi), then Fact [27] tells us that #simi(£, 
L(sim(CQ 1 ))) > 0, trivially implying that £ £ sp(S). Now assume wlog that q± £ Qi, C72 G 
Q 2 . For some i £ {1,2}, #simi(£, L(T qo )) / #simi(£, L(T qi )). Therefore, by Fact ETJ 
££sp{S). ' □ 
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3.5 Getting Useful Form-ideals 

Given a set S that does not span all of L(C), we can find a form-ideal that is useful wrt S. 
As we mentioned earlier, in a round we start with S, and end up with a useful I through 
various iterations. We will formally describe this process below. 

An iteration starts with a partial /, and a simple regular identity E in the ring R/I, 
which has multiplication terms with indices in [k]. At least one of the forms in E is not 
in sp(S U I). At the beginning of the first iteration, E is set to C and I is {0}. 



A SINGLE ITERATION 

1. Let 1 be a form in E that is not in sp(S U I). 

2. Add I to I. 

3. Consider E modulo I and let Q be the subset of indices of nonzero multiplication 
terms. 

4. Let U be the gcd of E(mod I), and let the gcd data be gcd = {(ir q , U, U q ) \ q G Q}. 

5. If the fanin, \Q\, of E(mod I) is 2, stop the round. 

6. If all forms in sim(E(mod I)) are contained in sp(SL)I), stop the round. Otherwise, 
set E to be sim(E(mod I)) and go to the next iteration. 

Lemma 30. Let C be a simple SIIS(/c, d) identity in R. Suppose S C L(R) and L(C)\sp(S) 
is non-empty. Then there is a form-ideal I useful in C wrt S. 

Proof. As discussed before in the intuition, we generate I in one round and the proof will 
be done by induction on the number of iterations in this round. For convenience, we set 
the end of the zero iteration to be the beginning of the round. We will prove the following 
claim: 

Claim 31. Consider the end of some iteration. There exists a list V of forms such that : 
for all q in the current Q, there is a list V q C L(T q ) that has an ordered I-matching to V . 
Furthermore, M(L(T q )\V q ) is similar to the term indexed by q in sim(E(mod I)). 

Proof of Claim [31\ This is proven by induction on the iterations. At the end of the zero 
iteration, E is just C and / = {0}. By the simplicity of C, sim(E(mod I)) is just C, and 
Q = [k]. So all the V^'s can be taken just empty. 

Now, suppose that at the end of the ith iteration, we have an ordered /-matching from 
V q to V for all q in the current Q. In the {i + l)th iteration we will denote by /' the set 
I U {£}, E' = sim(E(mod I)), and Q' C Q the subset of indices of non-zero terms in E' 
modulo I'. For a q E Q' , we have a list V q C L(T q ) and an ordered I-matching r q between 
V, V q . All forms of T q not in V q are in E'. Now consider the /'-matching n q between U, U q 
obtained in this iteration. No forms in these can be in sp(I'), since U is gcd(E' (mod I')) 
and q G Q' . Therefore, n q is an ordered matching. We can take the disjoint union of these 
matchings to get an ordered /'-matching r q U ir q between V UU and V q U U q . All forms in 
L(Tg)\(Yq U U q ) are in the qth term of sim(E' (mod /')). This completes the proof of the 
claim. □ 
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The number of iterations in a round is at most (A; — 2). This is because after each 
iteration, the fanin of the circuit E goes down by at least 1. Therefore, there must be a 
last iteration (signifying the end of the round). Consider the end of the last iteration. If 
the fanin \Q\ of E^mod /) is 2, then by unique factorization, sim(E(mod I)) is empty. So, 
all the forms in sim(E(mod I)) are in sp(S U I), at the end of a round. By the previous 
claim, there is a list V such that for every surviving q £ Q, there is a sublist V q C L(T q ) 
and an ordered /-matching r q between V and V q . By Fact [26l we have that -E(mod I) is 
„ e Q sc(T q )a q ■ M(L(T q ) \ V q ) and is an identity (in R/I). 

Let V q := V q \ (sp(S U I)\sp(I)) (similarly, define V). Note that T q induces a matching 
r' q between V' and V' q . Furthermore, ^2 q€ n sc(T q )a q ■ M(L(T q ) \ Vq) is a multiple of 
-E(mod /) and is regular (each term in the above sum is non-zero mod I). Thus, form- 
ideal / is useful in C wrt S. □ 

To prove a rank bound for minimal and simple T,HT,(k,d) identity C, our plan is to 
start with S = <f> and expand it round-by-round by adding the forms of a form-ideal, useful 
in C wrt S, to the current S. Trivially, such a process has to stop in at most kd iterations 
(over all rounds) but we intend to show that it actually ends up, covering all the forms 
in L(C), in a much faster way. To formalize this process we need the notion of a chain of 
form-ideals. This is just a concise representation of the matchings that we get from the 
various rounds. 

Definition 32. [Chain of form-ideals] Let C be a £!!£(£;, cf) circuit. We define a 
chain of form-ideals for C to be the ordered set T := {(C, Si, Ii, Qi), ■ ■ ■ , (C, S m ,I m , Q m )} 
where, 

• For all i £ [m], Si C L(R), Ii is a form-ideal orthogonal to Si and Qi C [k] . 

• S\ = (f) and for all 2 < i < m, Si = U 

• For all i £ [m], Ii is useful in C wrt Si. 

• For all i £ [m], Qi is a blocking subset of C, Si, Ii. 

We will use sp(T) to mean sp(S m U I m ) and #T to denote m, the length of T. The chain 
T is maximal if L(C) C sp{T). 

Note that by Lemma [HUj if a chain T of length m is not maximal, then we can find a 
form-ideal I m +i that is useful wrt S m U I m . This allows us to add a new (C, S m -i-i, I m -^i, 
Q m +i) to this chain. It is easy to construct a maximal chain for C, and the length of this 
can be used to bound the rank: 

Fact 33. Let C be a simple T l IlT l (k,d) identity. Then there exists a maximal chain of 
form-ideals T for C. The rank of C is at most (k — 2)(#T). 

Proof. We start with S± = 4> and an £ £ L(C). By Lemma [301 there is a form-ideal I\ 
(containing £) useful in C wrt Si with blocking subset, say, Q\. So we have a chain of 
form-ideals {(C, Si, Ii, Qi)} to start with. Now if L(C) has all its elements in sp(Si U Ii) 
then the chain cannot be extended any further and we are done. Otherwise, we can again 
apply Lemma [301 to get a form-ideal I2 useful in C wrt S2 '■= Si Uii with blocking subset, 
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say, Q2- Thus, we have a longer chain of form-ideals {(C, Si, Ji, Ji), (C, 52, 12, J2)} now. 
We keep repeating till we have a chain of length m where J(C) C sp(*S* m U Jm)* 

Note that 5 m U J m = Ui< m ^m- Each Jj is generated by at most (k — 2) forms, so there 
is a basis for L(C) having at most (k — 2)m forms. □ 

We come to a stronger version of the main theorem of this paper. 

Theorem 34. If C is a simple and minimal SIIE(fc, d) identity then the length of any 
maximal chain of form-ideals for C is at most ( 2 )(log 2 d + 3) + {k — 1). 

This theorem with Fact 1331 imply the main result, Theorem El We prove this theorem 
in the next section. 

3.6 Counting all Matchings: Proof of Theorem [341 

Let a maximal chain of form-ideals T for C be {(C, Si, I\, J\), . . ., [C, S m , I m ,J m )}. We 
will partition the elements of the chain into three types according to properties of the 
matchings that they represent. Each of these types will be counted separately. 

We first set some notation before explaining the different types. Let the m matchings 
data be: 

mdata(C,Si,Ii,Qi) =: {(Ti >q ,Vi,V i>q ) | q G Qi} 

We will use mdatai as shorthand for the above. For all q G Qi, Vi jq is a sublist of L(T q ) 
and Ti ;q is an ordered matching between Vi,Vi^ q by /«. By the definition of useful-ness 
of form-ideal Ij we have that Vi >q is disjoint to sp(Si U Jj) \ sp(Jj). Thus, Vi iq can be 
partitioned into two sublists: 

V i>q , := (£ G Vi, q I £ G sp(Ii)) , and 
*W := {£ G V*,, I ^ S p(5i U /<)) • 

and analogously V$ can be partitioned into two sublists 1^,0 and V^^. It is easy to see that 
these partitions induce a corresponding partition of Ti >q as Ti >q fl U Ti i9) i, where Tj jgi o (and 
is an ordered matching between ^0, Vi,q,o (and V^,i, V^ ig> i) by Jj. 
Here are the three types of mdatai's: 

1. [Type 1] There exist (71,(72 G Qi such that Vi )9li i is not similar to Vi m ,\- 

2. [Type 2] There exist (71,(72 G Qi such that is not similar to Vi m , but for all 
ri,r2 G Qi, Vi trit i and Vi, r2 ,l are similar. 

3. [Type 3] For all q±, qi G Qj, Vi qi is similar to Vi ?2 . In other words, mdatai is trivial. 

We partition [m] into sets N%,N2, N3, which are the index sets for the mdata of types 
1, 2, 3 respectively. 
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3.6.1 Bounding #N\ and #N 2 

The dominant term in Theorem 1341 comes from #N\. If #Ni is large, then by an averaging 
argument, for some pair (a, b), we find many matchings between forms in T a and These 
are all orthogonal matchings, but are defined on different sublists of L{T a ) and L(Tj,). 
Nonetheless, we can find two dissimilar lists that are matched too many times. Invoking 
Lemma [13] gives us the required bound. 

Lemma 35. #iVi < g) (log 2 d + 2). 

Proof. For the sake of contradiction, let us assume jf=Ni > ( 2 ) (log 2 d+2). For each mdatai 
(i G iVi), choose an unordered pair of indices Pi = {91,92} such that Vf i9li i and Pt l92 ,i are 
not similar. As there can be only („) distinct pairs, we get by an averaging argument that, 
s > (log 2 d + 2) of the Pj's are equal. Let P^ = • • • = Pj s = {a, 6} for ii < • • • < i s G N±. 
Now we will focus our attention solely on the ordered matchings /Xj := \T^~ a \ between 
^,a,i> ^1,6,1 by Ij, for all i G {ii, . . . , i s }. The source of contradiction is the fact that all 
these matchings are also well defined on the 'last' pair of sublists V^ ajCl5 i, T^ S! 6,l : 

Claim 36. For all i G . . . ,i s }> l*>i induces an ordered matching between V£ a i, Vi j, 1 
by h- 

Proof of Claim \36l The claim is true for i = i s so let i < i s . The matching /ij is an 
ordered Ij-matching between V^^i, Vi,6,i- For £ G K s ,a,i 5 ^ sp(Si a U ii s ). Since i < i s 
and L(T a ) \ Vi >a> i C sp(Sj U 1$), £ cannot be in L(T a ) \ V^ )0) i. Therefore, £ is in Vi >a) i. So 
/Xj maps £ to some element in V^i, showing /Xj is defined on the domain Vi 3i a,i- 

So we know /ij maps £ G ^i s ,a,l to an element /ij(^) G V^i. As /x, is an Jj-matching, 
A*i(^) = (c£ + a) for some c G F* and a G sp(Ii) C sp(Ii g ), thus /Xi(^) sp(Si 3 Uij s ) (recall 
£ sp(Si s U ij s )). Thus cannot be in L(T&) \ V^^^i (which has all its elements in 

sp(S is U As to begin with ^(i) G L(T b ) we get that p,i(£) G V is)b ^. 

Thus, /ij maps an arbitrary £ G V£ S)0) i to p-i(l) G V$ In other words, /ij induces an 
ordered matching between Vi ai0) i 3 Vi s ,&,i by /«. □ 

This claim means that there are 5 ^> (log 2 d -\- 2) bipartite matchings between V^ s?(li i, 
Vi Si b ) i by orthogonal form-ideals J^, . . . , Ij s respectively. Lemma HH implies that the lists 
^i s ,o,lj ^i s ,6,i are similar. This contradicts the definition of Pj s . Thus, #A"i < ( 2 )(log 2 d + 
2).' ' □ 

For dealing with #-/V 2 , we use a slightly different argument to get a better bound. We 
show that a Type 2 matching can involve a pair of terms at most once. 

Lemma 37. #iV 2 < Q . 

Proof. For the sake of contradiction, assume #-/V 2 > ( 2 ). For each mdatai (i G iV 2 ), let 
Pj be an unordered pair (91,92) such that Vi m is not similar to Vi m - Note that because 
^1,31,1 is similar to Vi g 2 i, it must be that Vi tqit o is not similar to V^ g2j o- By the pigeon-hole 
principle, at least two Pj's are the same. Suppose P^ = Pj 2 = {a, 6} for i\ < i 2 G iV 2 . 

Let £ G Vi 2) a,o then by the definition of Vi 2iai o we have that i G sp(Ij 2 ). This coupled 
with ii < i 2 means that £ cannot be in L(T a ) \ Vi ia i (which has all its elements in 
sp(S h U IjJ). As to begin with £ G L{T a ) we get that I G V* lj0) i. Thus, Fj 2 ,a,o (V~i 2 ,b,o) is 
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a sublist of Vi 1>a> i (^u,b.i)- From the useful-ness of Ij 2! the sublist Vi 2<a fi (V^^o) collects 
all the linear forms in L(T a ) (L(T&)) that are in sp{Ii 2 ) while from the useful-ness of 
the sublist L(T a ) \ V^ 1)0) i (L(Tb) \ ^5,1) is disjoint from sp(Ii 2 ). Thus, the sublist Vi 2A $ 
(Vi 2) b,o) collects all the linear forms in Vi lA ,i (Vf^i) that are in sp(Ii 2 ). This together 
with the similarity of Vi ltCl; i and V^^i gives us (by Fact [TO]) that Vi 2A $ and V^^o are 
similar, which contradicts the way Pi 2 = {a, b} was defined. Thus, #iV 2 < © • □ 

3.6.2 Bounding #N 3 

This requires a different argument than the pigeon-hole ideas used for #iVi and # . We 
divide these type 3 matchings further into internal and external ones. Our final aim is to 
prove : 

Lemma 38. #A 3 < (k - 1) 

We shall use a combinatorial picture of how the chain of form-ideals connects the 
various multiplication terms through matchings. We will describe an evolving forest J- 
and only deal with Type 3 mdatai. 

Initially, the forest J- consists of k isolated vertices, each representing the k terms 
Ti, • • • , Tfc. We process each mdatai in increasing order of the i's, and update the forest 
T accordingly. We will refer to this as adding mdatai to J- ' . At any intermediate state, 
the forest T will be a collection of rooted trees with a total of k leaves. 

Definition 39. Consider T when mdatai is processed. If all of Qi belongs to a single tree 
in T , then mdatai is called internal. Otherwise, it is called external. 

If mdatai is internal, T remains unchanged. While each time we encounter an external 
mdatai, we update the forest T as follows. We create a new root node labelled with mdatai 
(abusing notation, we refer to mdatai as a node), and for any tree of T that contains a 
T q , q 6 Qi, we make the root of this tree a child of mdatai. 

Fact 40. The total number of external matchings is at most (k — 1). 

Proof. Note that each external mdatai reduces the number of trees in the forest T by at 
least one. As initially T has k trees and at every point of the process it will have at least 
one tree, we get the claim. □ 

It remains to count the number of internal matchings. Whenever we encounter an 
internal mdatai, we can always associate it with some root mdatai' of T such that i' < i 
and all of Qi is in the tree rooted at mdatai' ■ 

Lemma 41. If mdatai is internal, then the subcircuit CQ i is identically zero in R. There- 
fore, by the minimality of C, no mdatai can be internal. 

This lemma with the previous fact immediately imply that j^N^ < (k — 1). We now set 
the stage to prove this lemma. Take any Type 3 mdatai. By the triviality of mdatai, the 
lists in {Vi t q I q £ Qi} are mutually similar. By the useful-ness of Ii the lists in {L(T q )\ 
Vi,q I 1 S Qi} have all their forms in sp(Si U Ij) \ sp{I{). Furthermore, Di := X^geQ; 
sc(Ti^ q )a q M{L{T q ) \ Vi t q) is a regular identity modulo Ii. Our aim is to remove the forms 
in Di which are common factors (not mod Ii, but mod 0). This gives us a new circuit 
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(quite naturally, that will turn out to be sim^Q^) that is still an identity (mod Ii). 
In other words, start with the subcircuit Cq 4 , and remove all common factors from this 
subcircuit. This is expected to be both sim(Cg i ) and an identity mod Ii. 

Using this we will actually show that if mdatai is internal then sim^Q^ is an identity 
(mod 0). Then we can multiply the common factors back, and Cq 1 would be an absolute 
identity (violating minimality of C). We proceed to show this rigorously. We have to 
carefully deal with field constants to ensure that sim(Cg i ) is indeed a factor of Di. 

Claim 42. For Type 3 mdatai, the circuit sim{CQ { ) is an identity mod Ii and has all its 
forms in sp(S{ U Ii). 

Proof. Let the gcd data of Di be: 

gcd(Di) := {(Tr it q, Ui, U i; q) | q £ Qi} 

where Ui >q is a sublist of L(T q ) \ Vi^ q and 7rj j9 is an ordered matching between Ui,Ui tq by 
{0}. Note that this is not mod Ii, even though Di is an identity only mod I{. 
By Facts l23l and l26l we can 'stitch' IPs and V's to get: 

• r[ q := Ti >q U 7Tj. 9 is an ordered matching between V( := V% U Ui, V( q := Vi >q U t7j )9 by 
I-. 

• D[ := Y^q&Q, sc ( T i,q) a q M ( L ( T q) \ v l, q )^ is a regular identity modulo h. 

Let q m be the minimum element in Qi. We have that r[ q T-~^ n is an ordered /^-matching 
between the similar lists V( qm , V( q . By Fact [231 we can construct an ordered matching 
between V^, V( q by {0}, with scaling factor equal to sc{r' iq r'r^) = sc{T' iq )/ sc^J. 

The way D[ is constructed it is clear that D\ is a simple circuit. This combined with 
the similarity of V( , V- under implies that the following set of #Qi matchings: 

{(^Kqm'Kq) I ?6Qi} 

is a gcd data of Cq i modulo (0) and the corresponding simple part is: 

<C Qi ) = Y, sc{^ q )a q M{L{T q ) \ VQ 



svm{ 

q£Qi 



sc(t'- 



D> 



sc(t! ) 



Thus, sim^Qj is a regular identity mod Ii as well. Also, by the useful-ness of 
sim^CQ^ has all its forms in sp(Si U Ii). This completes the proof. □ 

We now use the structure of T to show relationships between the various connected 
terms. 

Claim 43. At some stage, let mdatai be a root node of J 7 . Let X be a subset of the leaves 
of mdatai. Then L(sim(Cx)) is a subset of sp(Si U Jj). 
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Proof. Let the indices of all the external Type 3 mdata be (in order) i\, i%, ■ ■ ■ . We prove 
the claim by induction on the order in which T is processed. For the base case, let i := i\. 
Consider J 7 just after mdatai is added. The leaves of mdatai are all in Qi. By Claim [4*21 
L(sim(C Qi )) C sp(SiUli). Any X is a subset of Qi. By Fact 1281 L{sim(C x )) C sp(SiUli). 

For the induction step, consider an external mdatai. When this is processed, a series 
of trees rooted at mdata j 1 , mdata j 2 , ■ ■ ■ will be made children of mdatai. Every j r is less 
than i. Let Y r denote the leaves of the tree mdata j r . Note that Y r n Qi ^ (p. By the 
induction hypothesis, L(sim(CY r )) is a subset of sp(Sj r U Ij r ) (c sp(Si U /«)). Let Z\ 
be Qi U Y\. By Fact [29] applied to smi(CVi) and sim^Q^, we have that L(sim(Cz 1 )) 
is in U /,)). Let Z2 be Z\ U Yjj. We can apply the same argument to show that 

L(sim(Cz 2 )) is in sp(Si U With repeated applications, we get that for Z = \J r Y r , 
L(sim(Cz)) C sp(SiUli)). Note that Z is the set of all leaves of the tree rooted at mdatai. 
By Fact [281 L(C X ) C sp(Si U I;), completing the proof. □ 

We are finally armed with all the tools to prove Lemma [HI 

Proof, (of Lemma [4Tj) Consider some internal mdatai . All the elements of Qi are leaves 
in the tree rooted at some mdataj, for j < i. By Claim |4*3| L(sim(CQ i )) C sp(Sj U Ij). 
But by Claim H21 sim^Q^ = (mod Ij). Since Ii is orthogonal to sp(5j U Ij), Fact QT] 
tells us that sim(CQj is an identity (mod 0). Therefore, CQ i is an identity. □ 

3.7 Factors of a SLTE(A;, d) Circuit: Proof of Theorem [5] 

The ideal matching technique is quite robust and can be used to prove Theorem [5j Let C 
be a simple, minimal, nonzero circuit with top fanin k and degree d (so the different terms 
may have different degrees) that computes a polynomial p(xi,--- ,x n ). We remind the 
reader of the definition of L(p). Let us factorize p into 7T i qi, where each qi is irreducible. 
Then L(p) denotes the set of linear factors of p (that is, qi E L(p) if % is linear). 

For any q £ L(p), C = (mod q), therefore we can generate a form- ideal useful in C 
involving q. Using these we can create a chain of form-ideals whose span contains L(p), 
and all our counting lemmas for the matchings of types 1, 2, 3 will follow. As a result, we 
get a bound of (A; 3 log d) on the rank of L{p). 

4 Concluding Remarks 

It would be very interesting to leverage the matching technique to design identity testing 
algorithms. By unique factorization, matchings can be easily detected in polynomial 
time, and it is also not hard to search for /-matchings involving a specific set of forms in 
/. We prove that depth-3 identities exhibit structural properties described by the ideal 
matchings. Can we reverse these theorems? In other words, can we show that certain 
collections of matchings are present iff C is an identity? This would lead to a polynomial 
time identity tester for all depth-3 circuits. 

There is still a gap between our upper bound for the rank of 0{k i logci) and the lower 
bound of Q(k\ogd). We feel that £;log<i is the right answer and a more careful analysis 
of the matchings could prove this. More interestingly, it is conjectured that when the 
characteristic of the base field is 0, the rank is 0(k), independent of d. We believe that an 
adapation of our matching techniques to characteristic fields could lead to such a bound. 
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